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Abstract 

The complexity of a particular term-rewrite system is considered: the rule of associativity 
(x * y) * z > X* {y* z). Algorithms and exact calculations are given for the longest and shortest 
sequences of applications of > that result in normal form (NF). The shortest NF sequence for a 
term x is always n — dj^mix), where n is the number of occurrences of * in x and dj:m{x) is the 
depth of the rightmost leaf of x. The longest NF sequence for any term is of length n{n — l)/2. 



1 Preliminaries 

( |Klop 1992| ) provid es an overview of the theory of term rewrite systems. There is relatively 



little known about the complexity of various term rewrite systems. Here, I consider a particular 
rewrite system with one binary connective * and one rewrite rule {x * y) * z t> x * {y * z). 

A rewrite system — > is Strongly Normalizing (SN) iff every sequence of applications of — > 
is finite. A rewrite system is Church-Rosser (CR) just in case 

Vx,y.(x « — » y D 3z.{x — » z A y — » z)) 

A rewrite system is Weakly Church-Rosser (WCR) just in case 

yx,y,w.{w — >x A w — > y) D 3z.{x — »z A y — » z) 

Let — > be the relation between two terms such that x — > y just in case x contains a 
subterm, the redex, which matches the left hand side of the rule t> , and replacing the redex by 
the corresponding right hand side, the contractum, yields the new term y. A term is in normal 
form (NF) if it contains no redex. Let < — be the converse of — >. Let < — > be — > U < — . Let 
— » be the reflexive transitive closure of — > and similarly, « — the reflexive transitive closure of 
< — , and « — » the reflexive transitive closure of < — >. Note that « — » is an equivalence relation. 

Given a term x, define \{x) (resp. p{x)) refers to its the left (right) child of x. 

*This paper appears as Technical Report LCL 94-6 at the Computer Science Department of the Technion - 
Isr ael Institute of T echnology. It is also electronically archived in the Computation and Language E-Print Archive 
as |cmp-lg/9406034 



2 Longest rewrite sequence 



Given a term x, #x and ax are defined as follows: 

#x = 



if x is a leaf node 

1 + + i^p{^) otherwise 



ax 



if a; is a leaf node 

aX{x) + ap{x) + #A(x) otherwise 



Note that #x is the number of internal nodes in x. By convention, n is Note also that 
if x\>x' then x' = X{X{x)) * (p(A(x)) * p(a;)) and ax' = ax — {#X{X{x)) + 1). 

Lemma 1 — > is SN. 

proof Every term x is assigned a positive integer measure ax. An application of — >■ is 
guaranteed to lower the measure. This follows from monotonic dependency of ax upon the c's 
of each of x's subterm, and from the fact that locally, applying i> lowers a . □ 

Theorem 1 For every term x, there exists a NF-yielding sequence of ax applications of — 
furthermore, this is the longest possible NF sequence for x. 

proof The sequence of constructed by induction on ax: 

Base case: ax = 0. For every subterm y of x, #A(y) = 0, i.e. A(y) is a leaf. So x is in NF. 

Induction: I show that for every term x such that a{x) > there exists another term x' such 
that X — > x' and ax' = ax — 1. Let y be the deepest leftmost descendant of x such that y is a 
redox. Note that A(A(y)) is a leaf (otherwise X{y) would be a deeper leftmost descendant redex). 
Let y' such that y>y'. So = ay' = ay — 1 and by the straightforward dependency of 
ax' on the a's of each of x"s subterms, in particular y', ax' = ax — 1. 

The maximality of the length of the rewrite sequence follows from the fact that the applica- 
tions of — ^ decrease a by the minimum amount possible, 1. □ 

Corollary 1 For every term x, every sequence of applications of — > is of length at most 
n{n - l)/2. 

proof It suffices to show that for every term x, ax < n{n — l)/2. By induction on n: 
Base case: n = 1, ax = 0. 

Induction: Suppose true for all terms x' such that #a;' < n. Let m = #X{x). So < m < 
n — 1 and i^p{x) = n — m — 1. 

ax-n(n-l)/2 = aX{x) + ap{x) + #X{x) - n{n - 1) /2 

m,(m — 1) (n — m — l)(n — m — 2) n(n—l) 

^ ^T-^ + ^ 1 ^ + — 

= (m + l)(m — (n — 1)) 

< recalling that < m < n — 1 

□ 
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Corollary 2 There exists a term x that can be rewritten to NF by a sequence of exactly 
n{n — l)/2 applications of — >. 

proof 

An n-left-chain is defined as follows: A 0-left-chain is a leaf. An n-left-chain is an 
(n — 1)— left-chain * a leaf. Let x be an n-left-chain. ^x = n. I show by induction on n that 
ax = n{n — 1) /2: 

Base case: n = 1, ax = 0. 

Induction: Suppose true for an (n — 1)— left-chain. 

ax = aX{x) + #A(x) 

= {n-l){n-2)/2 + n-l 
= n(n-l)/2 

□ 

3 Shortest rewrite sequence 

I now show that a NF of a term (in fact the NF) can be computed in linear time. 

Lemma 2 — > is WCR. 

proof 

Let w he a term with two distinct redexes x and y, yielding the two distinct terms w' and 
w" respectively. There are a few possibilities: (without loss of generality, suppose x is not a 
subterm of y.) 

case 1: y is either not a subterm of x or it is a subterm of A(x) or a subterm of p{x) or it is 
p{x). In each case is clear that the order of application of — > makes no difference. 

case 2: y = X{x). For convenience let x = {{a*b) *c)*d. Applying i> at x gives (a*5) * {c*d); 
applying > at y gives {a* {b* c)) * d. The former can be rewritten to a* (b* {c* d)) using one 
application of > , and the latter is rewritten first to a * {{b * c) * d) which is then rewritten to 
a * (6 * (c * d)). 

□ 

Lemma 3 (Newman) WCR A SN D CR. 

Lemma 4 CR A SN D {\/x,y.{x « — » y A x,y are NFs ) D x = y). 
Theorem 2 — > NFs are unique. 

proof Follows from lemmas 0, § ^, and ^ □ 
Therefore any deterministic computational path of applying — > will lead to the NF. I now 
give an algorithm ctr for computing NFs. It applies i> as close as possible to the root of its 
argument. 
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ctri(x) first version 

1. y:=x 

2. while y is not a leaf and X{y) is a leaf 

3. y := p{y) 

4. if y is not a leaf 

5. then apply t> to y 

Lemma 5 The depth of the rightmost leaf of x is n iff a; is a NF 

proof X must be an n-right-chain — the mirror of an n-left-chain. □ 
Lemma 6 If drm{x) < n, algorithm ctri increases drm{x) by 1. 

proof algorithm ctri scans down the path from the root to the rightmost leaf, stopping at a 
redex y. By applying i> , it pushes everything in p{y) (including the rightmost leaf) one arc 
further away from the root. □ 
So iterating ctri n—diraix) times computes the NF. This process is inefficient, as it needlessly 
rescans the prefix of its argument. The following algorithm avoids this inefficiency. 

ctr(x) Gnal version 

1. y:=x 

2. while y is not a leaf 

3. while y is not a leaf and X{y) is a leaf 

4. y := p{y) 

5. if y is not a leaf 

6. then apply > to y 

Theorem 3 Given term x, ctr(x) computes a NF for x in n — drm.{x) applications of — 
proof Clearly, ctr gives the same result as ctri run n — dxm.{x) times, that is, the NF of x. □ 

4 Application 

Hepple and Morrill (1989) proposed using normal forms for overcoming certain difficulties with 
the parsing of Combinatory Categorial Grammar, a formalism for natural language syntax. The 
results above have been incorporated into an efficient parsing algorithm (Niv 1993, 1994). 
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